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[57] ABSTRACT 

A character recognition apparatus is arranged such 
that: a character pattern is extracted as a rectangle from 
the inputted picture image data; the number of picture 
element points corresponding to the direction code of 
the boundary point of the character portion is provided 
as the boundary direction density for each region.of the 
recognition object character pattern for each sub-region 
provided by the division of character pattern; the scan- 
ning operation is performed with respect to the side 
opposite respectively from four sides of the extracted 
rectangle; the picture clement point which changes 
from the background to the character portion is defmed 
as a change point; the picture element string number is 
increased by one each time the change point is detected 
so as to add the picture string number to each picture 
element; the number of the picture element point corre- 
sponding to the picture element string number is pro- 
vided as the background density for each region of the 
recognition object character pattern for each sub-region 
to be divided; the character recognition is performed 
using the boundary direction density for each region 
and the background density for each region. 

8 Claims, 8 Drawing Sheets 
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Fig. 9(a) 
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tion for extracting the recognition object character 

CHARACTER RECOGNmON APPARATUS region from the picture image memory portion, a direc- 

tion code designating portion for designating the direc- 
This apphcauon is a contmuation of now abandoned tion code of the picture element point of the recognition 
Ser. No. 829,992. filed Feb. 18. 1986. 5 object character region through its relationship with 

BACKGROUND OF THE INVENTION picture element points adjacent to the picture element 

_ ... ,. point, a direction density calculating portion for obtain- 

The present mvenflon relates to a character recogm- ing the number of picture element points corresponding 
uon apparatus which recogtnzes the types of newspa- to the direction code by the use of the dilution code of 
pers, magazme or the hke. and the hand-wntten charac- 10 the picture element pomt obtained by the direction code 
ten so as o convert them mto the character codes. g^^^ ^rtion. a change point detection portion for 
Conventionally m a character r«:ogmt,on apparatus. detecting, as a change ^)int. a picture element pdn 

ch«««Tf±''^^ •rj"^;*"-®*^^"?^ ^l^^'h =^8es into I character ^portion from a b^ck! 

character from the image data inputted from a seamier otvm,«^ «/^r+;«„ * * j i.^ t. 7 

or the like, and obtainini the feat^es of the recognition 13 ^ I T'' ^^^^^^^^j^ 

object character, and obtaining the distance be^een "^^^'^^^^^^^^ 

the features of a recognition candidate character group foTcS^^^^^^ ^"""'Vu fu'^T^ P"'''^" 

registered in advancelnd the features of the recognition hST^^^^^ f^^' 

object character so as to have, as a recognition result, T "t^^^^^^"^' PJ^*^^^ elements until 

the closest character from among the recognition candi- 20 ^^^^^^^ pomts are sequentially detected with respect 
date character group. *° scannmg direction m the change point detepting 

The construction of the conventional example is recognition portion for recognizing the can- 

shown in FIG. 11. The picture image provided through character usmg the number of picture element 

the scanning of the manuscript from the scanner 21, obtamed by the durection density calculating 

which is also referred to as an image input portion, U 25 P^^!°° number of the picture element points 

stored in the picture image memory portion 22. In the oo^^ned by the background density calculating portion, 
character extracting portion 23, the recognition object According to the present invention, the correct rec- 
character pattern is extracted as a rectangle from the ignition may be performed with respect to the charac- 
picture image data stored in the picture image memory *f " similar in shape such as "jl'. ""jj* which conven- 
portion 22. In the direction code designating portion 24, 30 tonally cause recognition errors, 
when the polarity of the picture element point of the . F^ifthermore, the recognition object character region 
recognition object character pattern is different from ^ divided into a plurality of sub-regions to calculate the 
the polarity of at least one of eight picture points adja- region-separation direction density and the region-sepa- 
cent to the picture element point, the picture element ration background density for each sub-region so that 
point is considered as a boundary point, and the direc- 35 correct recognition may be performed even for 
tion code of the boundary point 11 is determined similar characters which are only partially different, 
through its relationship with the picture element point Also, the sub-region used in obtaining the direction 
adjacent to such a boundary point 11 as shown in FIG. density for each region and the sub-region used in ob- 
2(b), the number of the boundary points corresponding taining the background density for each region are 
to each direction code is obtained as the features show- 40 f"^^^ same in the division region so that the process- 
ing the boundary direction density of the character. c^ sub-region division may be simplified. 

In the recognition portion 25, the distance between Th^ extracting of the recognition object character 
the features of the recognition object character ob- region through the rectangle by the character extract- 
tained by the direction code granting portion 24 and the portion and the division of the recognition object 

features of the recognition candidate character group 45 character region into horizontal and vertical directions 
registered in advance in the dictionary 26 is calculated, so as to determine the sub-region enable the region to be 
and the character closest in the distance in the recogni- simply expressed and enable the region to be simplified, 
tion candidate character group is the recognition result. The direction code may be obtained in the boundary 

In the conventional example, the boundary direction point where the polarity of the picture element point of 
density of the recognition object character "jtl' shown 50 the character pattern of the recognition object charac- 
in FIG. 7(a) and the boundary direction density of the ter region is different from the polarity of at least one 
recognition object character *i-jr^' shown in FIG. 7(Z>) picture element point selected among the picture ele- 
agree as in FIG. 12 so as to cause an erroneous recogni- ment points adjacent to the picture element point so that 
^i°"- the processing of the picture element point except for 

SUMMARY OF THE INVENTION boundary point is simplified and the direction den- 

^v^yiAKY u^ 1 Hb, INVENTION sity g^^jj ^^^^^ calculated stably indepen- 

Accordingly, an object of the present mvention is to dently of the thickness of the character, 
provide a character recognition apparatus which is The change point is detected through the recognition 
capable of correctly recognizing even characters which object region from four directions, by the change point 
are similar in shape. eo detecting portion so that characters similar in shape 

To achieve the above object, the present invention may be recognized correctly by simple processing, 
provides a character recognition apparatus for perform- 
ing recognition through the joint use of the direction BRIEF DESCRIPTION OF THE DRAWINGS 
density and the background density , and which is pro- These and other objects and features of the present 
vided With a picture image input portion for inputting 65 invention will become apparent from the following 
the picture images mcluding the recognition object description taken in conjunction with the preferred 
characters, a picture image memory portion for storing embodiment thereof with reference to the accompany- 
the mputted picture images, a character extracting por- ing drawings, in which: 
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FIG. 1 is a block diagram of a character recognition 
apparatus in accordance with one example of the pres- 
ent invention; 

FIGS. 2(j)-2(c) are illustration views of how to ob- 
tain the direction code of the boundary point of the 
character; 

FIG. 3 is a chart for illustrating the character extract- 
ing flow; 

FIG. 4 is a chart for illustrating the direction code 
designating flow; 

FIG. 5 is a chart for illustrating the picture element 
string number designating flow; 

FIG. 6 is a view for illustrating how to obtain the 
picture element string number, 

FIGS. 7(a)-7(6) are pattern views showing the recog- 
nition exercise characters "-j-" , ^-^-Jj-'; 

FIGS. 8(fl)-8(6) are pattemviews, in which the " 
**"ir' ^ accordance with one example of the present 
invention are divided into the sub-region; 

FIGS. 9{a)-9{b) are corresponding charts showing 20 
the number of the boundary points corresponding to 
each direction code for each sub-region of the "-|-^* , 
in the embodiment; 

FIGS. 10(a)-10(6) are corresponding charts showing 
the number of the picture elements corresponding to 25 
each picture element string number for each sub-region 
of the *-4;;" in the embodiment; 

FIG. 11 is a block diagram of a character recognition 
apparatus of a convention system; and 
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15 



the respectively facing sides from four sides, top, bot- 
tom, right, left of the extracted rectangle provided by 
the character extracting portion 3. When the picture 
element point which changes from the background 
ponion to the character portion of the recognition ob- 
ject character has been detected is as shown in FIG. 6. 
the picture element is regarded as a change point, and 
the picture-element string number which is the number 
of the group to which the picture element point belongs 
is increased by one so as to serve as the picture element 
string number of the picture element point. In the case 
of the picture element points other than change points, 
a picture element string number which is the same as 
that of a picture element point of the previous picture 
element point is provided. A background density calcu- 
lating portion for each region 7 obtains the number of 
picture element points corresponding to each picture- 
element string number using the picture-element string 
number of the picture element points provided by the 
picture-element string number designating portion 6 for 
each sub-region of PXQ, wherein the rectangle ex- 
tracted by the character extracting portion 3 is P- 
divided in the horizontal direction, and is Q-divided in 
the vertical direction. A recognition portion 8 respec- 
tively obtains features showing the boundary direction 
density of the character provided from the boundary 
direction density calculating portion for each region 5, 
and the features showing the background density of the 
character obtained from the background density calcu- 



FIG. 12 is a view showing the boundary direction 30 lating portion for each region 7, and the features show- 



35 



density of the character pattern shown in FIGS. 7(fl- 
yi(b) obtained by the conventional system, 

DETAILED DESCRIPTION OF THE 
INVENTION 

Before the description of the present invention pro- 
ceeds, it is to be noted that like parts are designated by 
like reference numerals throughout the accompanying 
drawings. 

Referring now to FIG. 1, there is shown a construe- 40 
tion view in one embodiment of a character recognition 
apparatus in accordance with the present invention. A 
picture image input portion 1 inputs images including 
recognition object charaters and stores the inputted 
images in a picture image memory portion 2. A charac- 45 
ter extracting portion 3 extracts a recognition object 
letter pattern in a rectangular form from the picture 
image memory portion 2. A direction code designating 
portion 4 designates the picture element point as a 
boundary point when the polarity of the image element 50 
point of the recognition object character pattern which 
has been extracted by the character extracting portion 3 
is different from the polarity of the picture element 
point of at least one picttxre element point selected from 
among eight picture element points adjacent to the 55 
picture element point; the direction code of the bound- 
ary point is determined by the relationship with eight 
picture element points adjacent to such a boundary 
point as shown in FIGS. 2(a) and 2(c). A boundary 
direction density calculating portion for each region 5 60 
obtains the nimiber of boundary points corresponding 
to each direction code using direction code of the 
boundary points provided by the direction code desig- 
nating portion 4 for each sub-region of MxN wherein 
the rectangle extracted by the character extracting por- 65 
tion 3 is M-divided in the horizontal direction, and is 
N-divided in the vertical direction. A picture-element 
string number granting portion 6 scans with respect to 



ing the boundary direction density of all of the charac- 
ters which become objects of recognition stored in the 
dictionary 9, and the similarity to the features showing 
the background density so as to extract the high similar- 
ity characters as a recognition result. A display ponion 
10 shows the recognition result extracted by the recog- 
nition portion 8. 

The processing flow of a character extracting portion 
3 of one embodiment of the present invention is shown 
in FIG. 3. 

Image data stored in the picture image memory 2 are 
scanned in the character string direction, and the histo- 
gram for each scan line is coimted; the start position is 
an n picture element or more in the value of the histo- 
gram and the completion position are used to determine 
the character string portion; the string portion charac- 
ter is further scanned vertically in the character string 
direction to count the histogram for each scan string so 
as to detect the start portion of the character pattern 
portion and the completion position which is an m pic- 
ture element or more in the value of the histogram is 
used to determine the character pattern so as to extract 
the recognition object character pattern. 

The processing flow of the direction code designat- 
ing portion 4 in one embodiment of the present inven- 
tion is shown in FIG. 4. 

When the polarity of the picture element point P(i, j) 
of the recognition object character pattern extracted by 
the character extracting portion 3 is different from the 
polarity of at least one pictue element point selected 
from among 8 picture element points adjacent to the 
picture element point, as shown in FIG. 2{b), P(i— 1, 
j-1), P(i-1, j). P(i-1, j-f 1), P(i, j-1), P(i, J+1), 
P(i+1, j-l), Pi+1, j), P(i+1, j + 1), the picture ele- 
ment point is designated as a boundary point. The direc- 
tion code of the picture element point is determined 
from its relationship with the eight picture element 
points adjacent to the boundary point as shown in FIG. 
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2(c). This is executed about all picture elements of the object character patterns for the two respective charac- 
recognition object character pattern. ters from the picture image memory portion 2. In the 

In the boundary direction density calculating portion direction code designating portion 4, the direction 
for each region 5, the rectangle extracted by the charac- codes of all of the boundary points of the recognition 
ter extracting portion 3 is M-divided in the horizontal 5 object character pattern are decided in accordance with 
du-ecuon, and is N^ividedm the vertical direction. The the conditions of FIGS. 2{a)-2{d). In the boundary 
number of the bound^ pomts is obtained, for each direction density calculating portion for each region 5, 
sub-region of the MXN. m accordance with each direc the number of the boundary points corresponding to 
don code by the use of the direcuon code of each pic- each direction code are obtained using thrdirection 
ture element pomt of the recogmUon object character lo code of the boundary point obtained in the direction 
pattern provided by the direction code grantmg portion ^ode designating portion 4 in each sub-region of sixteen 

* The process flow of the picture element string num. ^h^'^ J^^t^ in the horizontal direction, and 4- 
bergraStingportion6isshowninFia5.ThescLnng f^^^^J^'^ 'W.?Fir^^ 
operation is performed with respect to the side opposite 15 ^'^J^ *. 'ctz-'c'A.V J nll^ ^W-m- The 
from the left-hand side of the e!uracting rectangle pro- ^^^^ ^^P^^*^^^ 

vided by the character extracting portion 3. As shown ^1^^?" boundary pomts correspondmg to each 
in FIG. 6, the picture element point which changes into direction code m each sub-region of the sixteen of the 
the character portion from the background portion of recogmtton object character patterns respectively 
the recognition object character is detected to deter- 20 ^ '^^^^"'^^^^ ^® features showing 

mine the picture element point as a change point. The ^® boundary dir«:tion density of the recognition exam- 
picture element string number which is a group number P^® characters **±^\ " J^'. In the picture element string 
to which the picture element point belongs is increased number designating portion 6, the scanning operation is 
by one so as to have the picture element string number performed with respect to the respective opposite sides 
of the picture element point. When it is not a change 25 ^^P* bottom, right, left of the rectangle R 

point, a picture element string number of the previous shown in FIGS. 7(a)-7(6) to determine the picture ele- 
picture element point is designated. A similar process- m^nt string number for each scanning operation of all of 
ing is further performed from the top side, the right the picture element points in the recognition object 
side, the bottom side. character pattern in accordance with the conditions 

In the background density calculating portion for 30 shown in FIG. 6. In the background density calculating 
each region 7, the rectangle extracted by the character portion for each region 7, the number of picture element 
extracting portion 3 is P-divided in the horizontal direc- points corresponding to each picture element string 
tion, and is Q-divided in the vertical direction. The number in each sub-region is obtained using the picture 
number of picture elements corresponding to each pic- element string number of each picture element point 
ture element string number is obtained, for each sub- 35 obtained by tiie picture element string nuinber granting 
region of PxQ, by using the picture element string portion 6. The values shown in FIGS. 10(a)-10(6) are 
number of each picture element point in the respective the respective number of picture elements correspond- 
scanning from the left side, the top side, the right side, ing to each picture element string number in each sub- 
and the bottom side obtained from the picture element region of the sixteen in a case where the recognition 
string number designating portion 6. ^ object character patterns respectively shown in FIGS. 

In the recognition portion 8, with the factors of fea- l(a)-l(b) have been scanned from the left side. Simi- 
tures Ck (k= I, 2, . , . K) expressing the boundary direc- lariy, the scanning operation is performed from each 
tion density of the character obtained by the direction side of the top, bottom, right to provide the features 
density calculating portion for each region 5 and fea- showing the background density of the recognition 
tures Bl 0=1. 2, . . , L) expressing the background ^5 example characters "4^\ *M- . The dictionary 9 stores, 
density of die character obtained by the background for aU of the recogmuon candidate characters, the fea- 
density c^culatmg portion for each region 7, ^e tures showing the boundary direction density and the 
boundary dirwtion density Ctk background density Btl features showing the background density. In the recog^ 
(t = I, 2. . . . -n of the t-th recognition candidate charac- nition portion 8, the similarity is respectively obtained 
ter stored m the dictionary 9 m advance, the distMceDt between the the recognition example characters "f", 
with r^pect to Ck and Bl m connection with Ctk and ^t^. and the characters within the dictionary 9 abtlit 
Bti is obtamed by the followmg equation: the features on the boundary direction density and the 

background density to extract the most similar charac- 
« f \Ck-Vtk\ + 2 1 5/ - Bti\ ^ ^ recognition result. In the display portion 10, the 

*=! ' /=i ' ' 55 recognition result is displayed. 

In the present embodiment, the number of picture 

Furthermore, Dt is obtained for t= 1, 2, , . . T, and the element points corresponding to all of the picture ele- 
smallest value Ds in the Dt is extracted so that the S-th ment string numbers is obtained for all of the sub- 
character becomes the recognition result regions of the recognition object character pattern. 

The recognition example characters "J^", will 60 Needless to say, the number of the picture element 
be described by way of example about the operation of points corresponding to the restricted picture element 
the character recognition apparatus constructed as de- string number in the restricted sub-region may be used 
scribed hereinabove. as the features in recognizing. Also, in the present em- 

In the picture image input portion 1, the recognition bodiment, the picture element point which changes into 
example characters "j^', *^ are digitized and are 65 the character portion from the background portion has 
stored in the picture image memory portion 2. In the been detected as a change point when the respective 
character extracting portion 3, the rectangle patterns R scanning operations have been performd from the left 
shown in FIGS. 7(a)-7(£i) are extracted as recognition side, the top side, the right side and the bottom side in 



09/03/2004, EAST version: 1.4.1 



4,813,078 

3D1DD , . , 7,. ... ^? . 

ODOD DDOaEDT DADDDiDDDion DnDo: EDODiners ADDOQDDeD SeOrDD TOODDEASTDl. 4 . IDD 

DDDD DGDDEDT DDsernDDe: DDenDerDD 

DDDD SET DDDATTODDDoDADODlDDDenDerOa 

DDDQ SET 0DDATTDDDDoDADDD2a0Sl0993D 

DDDD SET D0DATTDDDDODADDD30DSDTDD 

DDDD SET DDDATTDDDDoDAaDD4D2004090310213D0 

□DDO SET DETODEDDDD 

DDDD SET DDODEDDDDD 

DDDD SET DDTODDDDDDED 

DDDD SET DDDDSDDDDDE 

DDDD SET DADEDDDETTED 

DDDD SET DDDDDOOD 

DDDD SET OESDDDTDDDDDOO 

DDDD SET DDTSDEDDDDEDDl 

DDDD EDTED DADDDADEDDDD 



